Back to Blog

I Published An AI Written Whitepaper For Code That Does Not Exist And Now I Want To Build It

I am revisiting an old repository. It is called ISRM. Infinitely Scalable Recursive Model. I need to be perfectly clear about my contribution. I did nothing. An AI wrote the entire whitepaper. An AI generated the architecture. An AI produced the mathematical proofs. I provided a prompt. I clicked publish. That is the complete extent of my involvement.

Published tables for experiments that never ran is a bold research strategy. I stand by the formatting. I do not stand by the data. The AI generated both.

The Claims Versus The Reality

The abstract promises true inference-time scalability. It claims loss decreases by twenty-five percent. It claims perplexity drops by four point three times. It claims the model extrapolates to K equals infinity. It cites the Banach Fixed-Point Theorem. It includes tables showing stabilization at K equals thirty-two. Every single number is fiction. The AI generated them because I asked for a research paper format. I accepted the output without running a single training step. You can shame me. I deserve it.

The repository lives at CompactAIOfficial/Infinitely-Scalable-Recursive-Model. You can read the whitepaper. You can admire the decay schedules. You can review the contractive mapping citations. You can also notice that the training dynamics section describes stabilization around five point zero. That stabilization exists only in markdown. The AI hallucinated a complete research pipeline. I published it.

The Architecture That Never Ran

The document describes a TinyNetwork with grouped query attention and SwiGLU activations. It outlines a fixed exponential decay schedule that forces convergence. It details latent refinement passes that act as working memory. It explains stateless step embeddings that enable extrapolation. The pseudocode looks elegant. The data flow diagram is beautifully formatted. None of it has ever executed.

# The theoretical loop the AI wrote
y_0 = initial_state
for k in 1..K:
    y_k = y_{k-1} + alpha_k * (f(y_{k-1}, input) - y_{k-1})
output = y_K
# Elegant on paper. Untested in practice. Currently theoretical fiction.

The AI cited Deep Equilibrium Models. It referenced PonderNet. It compared ISRM to Universal Transformers. It wrote a philosophical section about trajectory versus destination. It generated a K equals four anomaly explanation for a spike that never occurred. I contributed zero lines of code. I contributed zero mathematical proofs. I contributed a commit button and excessive optimism.

The Vibecoding Confession

This is what happens when you let a language model write your research. You get a document that reads like a conference submission. You get mathematical notation that looks rigorous. You get a training methodology section with AdamW optimizers and cosine decay schedules. You also get zero functional software. I published tables for experiments that never ran. I accepted the generated architecture because it sounded smart. I never ran it because I got distracted by tiny models that actually train.

Vibecoding produces excellent documentation. It produces questionable implementations. I have both. I need to fix the second one.

Claude Is Trying To Untangle It

I asked Claude to help me actually implement it. It is reading the AI-generated pseudocode. It is trying to reconcile the contractive mapping theory with actual PyTorch tensors. It is asking questions about Lipschitz constraints that the original AI hand-waved with confident markdown. We are both staring at a repository that promises infinite scalability and currently struggles with basic module initialization.

Claude is patient. It suggests corrections. It rewrites the refinement loop. It adds proper gradient handling. It reminds me that theoretical convergence guarantees require actual weight updates. I appreciate the effort. I also appreciate that the model is politely ignoring the fact that I published experimental results for a training run that never happened.

Why I Want To Revisit It

The concept still interests me. Trading inference compute for quality makes sense. Small models that think longer instead of bigger models that think once feels like a valid direction. The recursive refinement loop is elegant. The decay schedule is clever. The latent working memory idea has merit. The AI accidentally stumbled onto a coherent idea while generating academic prose. I want to see if the idea survives contact with reality.

I train tiny models. I care about efficiency. I care about doing more with less. ISRM aligns with that philosophy. It just needs to actually work. That is the new goal. No more fictional tables. No more untested claims. Just code that runs and loss curves that exist.

What Needs To Happen

First I need to strip the whitepaper down to testable components. Second I need to build the TinyNetwork without assuming the math will magically enforce convergence. Third I need to run actual training with dynamic K sampling. Fourth I need to accept that the K equals four anomaly might just be the model collapsing entirely. I will document everything. I will probably break everything. That is the process.

I will start small. I will verify the decay schedule. I will check the gradient flow. I will run K equals one through K equals eight and see if the loss actually decreases. If it does, I will celebrate. If it does not, I will rewrite the refinement step. I will repeat until the architecture behaves like the documentation claims it should.

Final Thoughts

ISRM exists as a repository full of AI-generated confidence and untested code. I wrote none of it. I tested none of it. I published all of it. Now I want to see if the underlying idea actually works. Claude is helping. The GPU is waiting. The math is questionable. The ambition remains.

If it works, I will update the tables with real numbers. If it fails, I will write a blog post about contractive mappings that refused to contract. Both outcomes are educational. Both outcomes are very on brand for me.

You can shame me for the untested whitepaper. You should. I deserve it. I will earn the redemption by making the code run. Or I will fail publicly. Both paths lead to learning. I prefer the first one. I am prepared for the second one.